Design Issues in Automatic Grapheme-to-Phoneme Conversion for Standard Yorùbá

نویسندگان

  • Abímbólá R. Ìyàndá
  • Odétùnjí Àjàdí Odéjobí
چکیده

Grapheme-to-Phoneme (G2P) conversion is an important problem in Human Language Processing development, particularly Textto-Speech (TTS). Its primary goal is to accurately compute the pronunciation of words in the input texts. This work examines design issues with respect to components of the automatic G2P for standard Yorùbá (SY). The automatic process includes: (i) Tokenisation of Input, (ii) Identification of nasal characters, (iii) Syllabification, and (iv) Conversion of Graphemes to Phonemes. The structure of the Yoruba text is described and a text corpus design for standard Yorùbá TTS is presented. The analysis of the data was done using Zipf’s law. The outcome of this work provided adequate requirement for the design.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Decision Tree Learning for Automatic Grapheme to Phoneme Conversion for Tamil N.Udhyakumar, C.S.Kumar, R.Srinivasan and R.Swaminathan

This paper describes a novel approach for grapheme to phoneme conversion using decision tree learning technique. The proposed approach, unlike the rule based approach, can generate rules spanning wider context and thus give better accuracy for the conversion.

متن کامل

A Novel Approach to Unsupervised Grapheme–to–phoneme Conversion

Automatic, data-driven grapheme-to-phoneme conversion is a challenging but often necessary task. The top-down strategy implicitly adopted by traditional inductive learning techniques tends to dismiss relevant contexts when they have been seen too infrequently in the training data. This paper proposes instead a bottom-up approach which, by design, exhibits better generalization properties. For e...

متن کامل

Probabilistic Context-Free Grammars for Syllabification and Grapheme-to-Phoneme Conversion

We investigated the applicability of probabilistic context-free grammars to syllabi cation and grapheme-to-phoneme conversion. The results show that the standard probability model of context-free grammars performs very well in predicting syllable boundaries. However, our results indicate that the standard probability model does not solve grapheme-to-phoneme conversion su ciently although, we va...

متن کامل

Grapheme to phoneme conversion using an SMT system

This paper presents an automatic grapheme to phoneme conversion system that uses statistical machine translation techniques provided by the Moses Toolkit. The generated word pronunciations are employed in the dictionary of an automatic speech recognition system and evaluated using the ESTER 2 French broadcast news corpus. Grapheme to phoneme conversion based on Moses is compared to two other me...

متن کامل

Solving the Phoneme Conflict in Grapheme-to-Phoneme Conversion Using a Two-Stage Neural Network-Based Approach

To achieve high quality output speech synthesis systems, data-driven grapheme-to-phoneme (G2P) conversion is usually used to generate the phonetic transcription of out-of-vocabulary (OOV) words. To improve the performance of G2P conversion, this paper deals with the problem of conflicting phonemes, where an input grapheme can, in the same context, produce many possible output phonemes at the sa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Research in Computing Science

دوره 90  شماره 

صفحات  -

تاریخ انتشار 2015